BOUNDS FOR BINARY CODES RELATIVE TO PSEUDO-DISTANCES OF k POINTS 



CHRISTINE BACHOC, GILLES ZEMOR 



ABSTRACT. We apply Schrijver's semidefinite programming method to obtain improved upper bounds 
on generalized distances and list decoding radii of binary codes. 



1. Introduction 

Let H n := FJ? denote the binary Hamming space, endowed with the Hamming distance. One of 
the longstanding problems of coding theory is to find estimates for the maximum cardinality A(n, d) 
of a code C C H n with the constraint that the Hamming distance of any pair of distinct elements 
of C is at least equal to d. The best known upper bound for A(n, d) is obtained with the so-called 
linear programming method, due to Philippe Delsarte, and is the optimal value of a linear program (LP 
for short) (|9), (8, Chapter 9]). Because linear programs come with efficient algorithms, this method 
yields good numerical bounds for given parameters (n, d). Moreover, close to optimal explicit feasible 
solutions have been found from which upper bounds in the form of explicit functions of n and d have 
been derived lfl2l . as well as an upper bound in the asymptotic range [13]. After these significant 
achievements, the subject fell into a period of about twenty years during which nothing really new 
was discovered, until A. Schrijver in [ 15 ] obtained improved upper bounds on A(n, d) for some small 
values of the parameters (n, d), using semidefinite programming. Although these improvements are 
numerically not all that impressive, the method behind them introduces genuinely new ideas. In order 
to explain them, it is good to go back to Delsarte's method. Let us recall that the variables of the 
Delsarte linear program represent the distribution of the Hamming distance in the constrained code. 
More precisely, let 

x i ITTn card {(>, y) G C 2 : d(x, y) = i}. 

card(C ) 

Then the main idea is to observe that these variables satisfy certain linear inequalities, the non trivial 
ones being related to the Krawtchouck polynomials K%(x), namely, for < k < n, 



n 

E 

i=0 



K%(i)xi > 0. 



Schrijver's new idea IPT51 is to exploit constraints on triples of points (x, y, z) G C 3 rather than 
deal only with pairs. It turns out that the natural constraints are semidefinite positive (SDP) instead of 
linear. The variables of the program are 

1 „ 

x a ,b,c ■= jtttt card{(x, y,z) € C : d(y, z) = a, d(x, z) = b, d(x, y) = c} 

card(C J 

and the SDP constraints take the form 

y^Xa,b,cS(a, b,c) >z 0, 
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where >z stands for "is positive semidefinite", for some symmetric matrices S(a, b, c). These SDP 
constraints are closely related to the action of the symmetric group S n on the functional space M^ 71 ; 
more precisely, each S n -irreducible module occuring in R Hn gives rise to an SDP inequality with 
matrices of size equal to its multiplicity. It should be noted that the full group of automorphisms 
Aut(i/ n ) acts multiplicity free on the same space R Hn , and that it is the true reason why in the case 
of the Delsarte method, the constraints are linear. 

The aim of the present paper is to show that the Schrijver method can be used not only to strengthen 
the LP bounds, but also to give bounds for other problems, to which the LP method does not apply. 
Indeed, in recent years several generalizations of the Hamming distance, in the form of functions 
(we will call them pseudo-distances) of k > 3 elements of H n have attracted attention. We consider 
here three such functions f(x\, . . . , Xk), namely the generalized Hamming distance d(x\, . . . , xj.), 
the radial distance r(x\, . . . , Xk) and the average radial distance r(x\, . . . , x^)- They share the crucial 
property of being invariant by the action of the automorphism group Aut(# n ) of the Hamming space. 

The generalized Hamming weights of linear codes were introduced by Ozarow and Wyner [ |T4l in 
view of cryptographic applications related to the so-called wire-tap channel. The concept was later 
made popular for its own sake by Wei |[T9ll . The notion was extended to the non linear setting in [7] 
in order to derive bounds on generalized weights. The generalized Hamming distance d(x\, . . . , x^) 
of k points is the number of coordinates where the k points are not all equal. Thus d(x\, X2) is the 
classical Hamming distance. In Q, the authors derive bounds for generalized distances, focusing on 
asymptotics, which are analogs of the classical Hamming, Plotkin and Elias-Bassalygo bounds. In the 
case of linear codes the best known asymptotic upper bounds were obtained in [H. 

The radial distance and the average radial distance are related to the notion of list decoding. The 
radial distance or radius of k elements is the smallest radius of a Hamming ball that contains the k 
points. If a code C has the property that the radius of any /c-tuple of pairwise distinct points is at least 
equal to some value r, then any ball of the Hamming space of radius r — 1 intersects C in at most 
k — 1 points. Thus a decoding procedure that outputs every codeword at distance at most r — 1 of 
any given received vector yields a list of codewords of cardinality at most k — 1. The search for large 
codes with given minimum -radius is also studied in the litterature as the quest for dense multiple 
packings: indeed, a code of minimal /c-radius r provides a packing of balls (centered at the codewords, 
of radius r — 1) such that any element of H n belongs to at most k — 1 balls. These notions have a 
long standing history, going back to problems in Euclidean geometry and to early coding theory. They 
came back into the limelight some ten years ago when Sudan discovered his now famous algorithm 
for list decoding of Reed-Solomon codes lfl6l . Blinovskii Q establishes asymptotic bounds on the 
maximal number of elements of a code with given minimal radius: in the process he defines an 
auxiliary quantity, the average radius of k elements that we will also investigate. 

In general, we are given a function / from il™ into the set of non-negative integers, and we denote 
by Ak-i (n, f, to) the maximal number of elements that a binary code C can have under the constraint 
that f(x\, . . . , x^ > m for every /c-tuple of pairwise distinct codewords. Our goal is to show that the 
SDP method gives good upper bounds for ^(n, /, m) for modest values of n, when compared with 
the classical bounds. Our results provide strong motivation for the development of the SDP method, 
which is far from being at the same stage of achievement as the LP method. 

The paper is organized as follows: Section 2 provides a description of the orbits of Aut(H n ) 
acting on This preliminary task is essential since the pseudo-distances we are dealing with only 
depend on these orbits. Section 3 recalls the definitions and basic properties of the three particular 
functions we consider. Section 4 defines the code invariants associated to these functions and recalls 
their significance for applications. Section 5 settles the "classical" bounds. These bounds already 
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appear in the litterature (Q, 0, (6l) but not in the precise form needed here: either they are settled 
only for linear codes, or the concern is in the asymptotic setting and they are not as tight as they 
can be for small parameters. Section 6 recalls the SDP method of [15] using the language of group 
representation, i.e. following (31, iflTl . lfl8ll . Section 7 provides some numerical results. 

2. The orbits of Aut(H n ) acting on H k 

The automorphism group of the binary Hamming space H n := ¥ 2 , denoted by Aut(H n ), is the 
semi-direct product of the group of translations by elements of H n with the group of permutations on 
the n coordinates. The group Aut(H n ) acts two-point homogeneously on H n , which means that the 
orbits of Aut(if n ) acting on H% are characterized by the Hamming distance. In other words 

{x, y) ~Aut(ff„) (x, y) d(x, y) = d(x', y'). 
Here (x, y) ~Aut(if n ) y') stands for: there exists g G Aut(H n ) such that g(x) = x' and g{y) = y' . 
We want to study the action of Aut(ff n ) on fc-tuples (x\, . . . , Xk) G H%. We introduce: 

Definition 2.1. For x = (x±, . . . , Xk) € il„, and for u G F|, let 

n u (x) := card{ j, 1 < j < n : ((xi) j} (x k )j) = u} 
and let the "weight distribution" ofx be defined by 

W(x) := {n u {x)) u& k. 

For u G F|, the word obtained from u by flipping zeros and ones, will be denoted by u. In other words 
u = u+ l k . One of{u,u} has the form Ow with w G F* -1 . Let 

n w (x) := n 0w (x) + nm(x). 

The "symmetrized weight distribution " ofx is defined by: 

W(x) := (n w (x)) w& k-i 

Remarks: 

(i) It is nice to identify x with the (k, n) matrix M(x) whose i-th line equals xi. Then n u (x) is 
the number of columns of x which are equal to u: 

xi = 000 ... 

x 2 = 111 . . . 1 

M(x) =■_=■_ ; 

x k =111^_1 

n u (x) 

(ii) We have X^eF§ n u(x) = E^gF*" 1 = n - 
Proposition 2.2. 

x ~Aut(^„) V W(x) = W(y). 

Proof. It is clear that x ~Aut(i? n ) V ^ ^ y' where x[ = (0, xi — xi, . . . ,Xk — x\) and y' = 
(0, y 2 -yi,...,yk- yi). Then W(x') = W(y') iff W(x') = W(y') and is left unchanged if the 
coordinates are permuted. Conversely, for an appropriate permutation a of the coordinates, cr(x') has 
its columns reordered in lexicographic order. Another permutation r has the same effect on y'\ since 
W(a(x')) = W(r(y')). it means that a(x') = r(y')- ~ □ 
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Remark 2.3. 

(i) If k = 2, we have of course n\(x) = d(x±, x 2 ) and no(x) = n — n\(x). In the case k = 3, 
we have 

nio + nu = d(xi,x 2 ) 
mi + nio = d(x 2 ,x 3 ) 
n i +nn = d(x 3 ,xi) 

and the triple (d(xi,x 2 ),d(x 2 ,xs), d(x 3 ,xi)) uniquely determines the orbit of(xi, x 2 , £3). 

(ii) For arbitrary k, taking into account the relation n w = n, the orbits of Aut(H n ) on 
are described by 2 k ~ 1 — 1 independent parameters. In contrast, the orbits of k-tuples of 
elements of the unit sphere of the Euclidean space S n_1 under the action of the orthogonal 
group 0(R n ) need only (g) rea -l numbers in order to be uniquely determined, namely the 
pairwise inner products of the k vectors. The orbits of H% under Aut(H n ) are determined 
by the pairwise distances d(xi,Xj) only ifk = 2, 3. 

(iii) In the next section we introduce several functions /(a?i, . . . , Xk) such that 

f(a(xi), . . .,a(x k )) = f{xi, ...,x k ) 

for all a £ Aut(H n ). It follows from the above description of the orbits of H% that such 
functions have an expression of the form f{x\, . . . , x k ) = f(W(x)). 

3. Aut ( H n ) - IN VARIANT FUNCTIONS ON H*. 

3.1. The generalized Hamming distance. 

Definition 3.1. The generalized Hamming distance ofk elements of H n is defined by: 
d(xi, ... ,x k ) = card{j, 1 < j < n : card{(xi)j, . . . , (x k )j} > 2} 
= card{j, 1 < j < n : {( Xl )j, (x k )j) £ {0 k , l k }} 

Proposition 3.2. The following properties hold for the generalized Hamming distance: 

(i) d(xi,x 2 ) is the usual Hamming distance. 

(ii) For all permutation t of {1, . . . , k}, d(xi, . . . , x k ) = d(x T (i), . . . , x T ( k )). 

(iii) For all a € Aut(H n ), d(xi, . . . , x k ) = d(a(xi), . . . , a(x k )). The generalized distance 
d(xi, . . . , x k ) is related to the weight distribution by: 

d(xi, . . . ,x k ) = ^2 n w (x). 

(iv) d(xi, . . . , x k -i, x k ) = d(xi, . . . , x k -i) if x k belongs to the affine subspace generated by 
X\, . . . , x k — 1. 

(v) "Triangular" inequality: for all y € H n , 

1 k 

d(xi, ...,x k )< -j—^ ^2 d ( x i> • • • > x *-i> y> • • • ' x k)- 
1=1 

(vi) The distance of three points can be expressed in terms of pairwise Hamming distances: 

d(xi,x 2 ,x 3 ) = -(d(xi,x 2 ) + d(x 2 ,x 3 ) + d(x 3 , xi)). 
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(vii) For more than three points we have only the inequality: 

d(xi,x 2 ,...,x k ) < -— ^2 d(xi,Xj). 

l<i<j<k 

(viii) We also have the inequalities: 

1 k 

d(xi, ...,X k )< j—j ^ • • • > Xi-!,X i+ l, ...,x k ) 

i=l 

1 k 

d(xx, . . . , Xfc) > - y~] d(xi, ... , Xi-i,x i+ \, x k ). 



k 

i=l 

Proof. Properties (i), (ii), (iii) are obvious. 

If Xk belongs to the affine subspace generated by x\, . . . , x k -\, then we can write x k = J2iZ\ a i x i 
with J2i=i a i = 1- Consequentely, if • • • , (xk-\)j) = fc_1 , respectively then we have 

((xi)j, . . . , (xk)j) = fc , respectively l k . It follows (iv) that d(x\, . . . , Xk-i, Xk) = d(x\, . . . , Xk-i). 

The announced "triangular" inequality (v) is easily checked in the case n = 1. The general case 
follows from the fact that 

n 

(1) d(xi,...,x k ) = '^2d((x 1 ) j ,...,(x k )j). 

i=i 

Again because of (Q~|), it is enough to prove (vi) (vii) and (viii) for n = 1. In this case, let the 
Hamming weight of (x±, . . . , x k ) be denoted by w, then 

_ J 1 ifl<u;<fc-l 
d( Xl ,...,x k ) - | Q ifw = 0;A ,_ 

and _ 

d(xi,Xj) = w{k — w). 

i<i<j<k 

Obviously w(k — w) > k — 1 if w ^ 0, k and equals otherwise. Inequality (vii) follows. In the case 
k = 3, w(k — w) takes only the values and 2 hence (vi). 
To prove (viii), notice that we have 

k ( k if2<w<k-2 

y~]d(xi, . . . ,Xj-i,x i+ i, . . . ,x k ) = < k-1 ifw = l,k-l 
i=i [ iiw = 0,k 

hence the announced inequalities. □ 
3.2. The radial distance. 

Definition 3.3. The radial distance or radius of k elements of H n is defined by: 

r(x±, . . . , x k ) = min{r : there exists y £ H n s.t. {x\, . . . , x k } C B(y, r)} 

= min{ max d(y, Xj)}. 
y l<i<k 

Proposition 3.4. The radial distance has the properties: 

(i) r(x 1 ,x 2 )=\ d Mp±\ 

(ii) For all permutations r of {1, . . . , k}, r(xi, . . . , x k ) = r(x T ^, . . . , x T r k \). 

(iii) For all a G Aut(H n ), r(xi, ...,x k ) = r(a(xi), . . . ,a(x k )). 
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(iv) For all k, 

r(xx, ...,x k )> max r(xi, . . . , acj_i , x i+x , ■ ■ -,x k )- 

l<i<k 

(v) For k = 3, we have 

r(xi,x 2 ,x 3 ) = max{r(x 1 ,x 2 ),r(x2,x 3 ),r(x 3 ,xi)}. 

Proof. Properties (i) (ii) (iii) and (iv) are obvious. 

Let (xi,X2,x 3 ) € H%. Without loss of generality we can assume that d(x2,x 3 ) < d(x 3 ,x\) < 
d(xi,X2) and that x\ = 0. With the notation of section 1 it amounts to assume that n i < n 10 < n\\. 
Let y € H n be the center of a smallest ball containing the three words; clearly the coordinates of y at 
the positions corresponding to w = 00 in M(x) must be equal to 0. Let y w be the number of ones at 
the positions corresponding to w. We have: 

d(y,xi) = j/oi +Vw + 2/11 

d(y, x 2 ) = 2/01 + nio - yio + «u - yu 

d(y, x 3 ) = n 01 - 2/oi + 2/io + "11 - 2/H 



We choose y such that: 

2/oi = 

i "-oi + "io + 2nn 
2/11 = L ^ I < "ll 

i "-io - "01-, , 
2/io = L ^ I < "io 

Then one easily verifies that for i = 1,2,3, d(y,Xi) < |" ni0+w 11 ] thus the ball centered at y with 
radius |" ni0+ni1 ] contains the three words x\,X2,x 3 . Since 

"io + "n = d(xi,x 2 ) = max(d(x2,x 3 ),d(x 3 ,xi),d(xi,X2)) 
we have proved that 

r(xi,x 2 ,x 3 ) < max{r(x 1 ,x 2 ),r(x2,x 3 ),r(x 3 ,xi)}. 

□ 

Remark 3.5. For k > Awe cannot give a nice expression ofr(x) as an explicit function ofW(x). 
It should be noted that the determination of the center y and thus ofr(x) cannot be performed by a 
sequence of local decisions at each coordinate or even at each subset of coordinates corresponding to 
each u; in other words property (Q]) ofd( ) does not hold for r and it makes it more difficult to study. 
However for k randomly chosen points, the distances of each point to the center y of the smallest ball 
containing them are expected to have about the same value, in other words the points are expected to 
be close to the border of the ball. When it is the case, the radius of the k points is approximated by a 
much nicer function, called the average radius (or moment of inertia), introduced in [5]. 

3.3. The average radial distance. 

Definition 3.6. The average radial distance or average radius (or moment distance or moment of 
inertia) ofk elements of H n is defined by: 

r(xi, . . . ,x k ) = min- ^ d(y,Xi). 



y k 

Ki<k 
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Proposition 3.7. The average radius has the properties: 

(i) r(x 1 ,x 2 ) = ^fl. 

(ii) For all permutation r of {1, ... , k}, r(xi, . . . , Xk) = r(x T tu, . . . , x T t k \). 

(iii) For all a € Aut(H n ), r{x\, . . . , x k ) = r(a(xi), . . . , <r(xfc)). In terms of the weight distri- 
bution W(x) = {n w (x)) weV k-i, 

f(x±, . . . , Xk) = t mm(wt(w),k — wt(w))n w (x) 

weF* 

(iv) For aZZ 

r(xi, . . • , > - ^ r(xi, Xi-i,Xi+i, ...,x k ) 

i=l 

(v) The above inequality is an equality for k = 1 mod 2. particular, we have 

r(xi,x 2 ) +r(x 2 ,x 3 ) +r(x 3 ,xi) 
r(xi,x 2 ,x 3 ) = 

(vi) For aZZ 

x 2(jfe-l)^_. 
r(xi,...,x fc ) < k ^ _ ^ 2^ r ( Xl ' • • ■ ,Xi-i,x i+1 ,. . .,x k ). 

(vii) "Triangular" inequality: for all y G H n , 

1 fe _ 

r(xi, ...,x k )< y- - ^2 r ( x i> • • • ' x i-l> 2/' ■ ■ ■ ) s fc)- 
1=1 

Proof. Properties (i) (ii) and the Aut(i? n )-invariance are trivial. 

If the j-th column of the matrix M(x) equals u € F§, the contribution of this column in ^ i Xi) 
is equal to wt(u) if yj = and to wt(u) if yj = 1. So the minimum of this sum over all y equals 

min(wt(u) , wt(u))n u (x) . 

u 

which leads to the formula announced in (iii). It also shows that 

n 

r(xi,...,x k ) = ^r{{xi)j,...,{x k )j). 

3=1 

Consequently, in order to prove the remaining assertions, we can assume n = 1. Let the weight of x 
be denoted by w. Without loss of generality we assume that either w < k/2 or w = k/2. This last 
case can only happen if k = mod 2. We prove (v) and (vi): in the case w < k/2, removing Xi = 1 
makes kr(x) drop by 1 while removing x,- L = does not change kf(x). In the case w = k/2, kr(x) 
always drops by 1. In other words, 



(k - l)r(xi, x^, x i+ i, ...,x k )= < 

and 



w — 1 if Xi = 1 and w < | 
w if Xi = and w < % 



w-1 if w = | 



(fc-l)gr(x 1> ...,a: i _ 1> x i+1 ,...,x fc ) = { ^ ^ ^ = | 
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We obtain 



l* r fix) if^^i 



2 

hence the inequalities 

(2fc -2) T ^ ~ ^^^ Xl '-'-' Xi - 1 ' Xi + 1 '---' Xfc ) 

If k = 1 mod 2, the case w = k/2 never happens so the second inequality is always an equality 
For the triangular inequality (vii), we find 

k 

ky~]r(xi, . . . ,Xi-i,y,x i+ i, ...,x k ) = 



i=i 



hence 



(k - l)w + k if w < L|J and y = 1 
( (k — l)w if w = | and y = 1 

few if w = and y = 1 

(fc — l)w ify = 

fe 

fcy^r(zi, . . .,Xi-i,y,x i+ i, . . . ,x k ) > (k - l)w = k(k - l)r(x). 



□ 



i=i 

3.4. Relationships between d, r, r. 

Proposition 3.8. The following hold: 

(i) For all x = (x\, . . . , x k ) € ii/^, 

— < f(x) < r(x) < d(x) 
k 

and 

r(x) < -d(x). 

(ii) For k = 2, 3, d(x) = kr(x). 

(hi) Ifr(x) = r(x) then the center of any of the balls of minimal radius r(x) containing the 
points (cci, . . . , Xk) is equidistant to these points. The converse is false, in the sense that the 
points may be equidistant to some y while r(x) < r(x). 

Proof. Since 

1 k 

]:^2 d (y> x i) <maxd(y,Xj), 

i=l 

we obviously have r(x) < r(x). From 

1 < mm(wt(w), k — wt(w)) < k/2 

for w / fc_1 and from the expressions given in Proposition 13.71 (hi) for r(x) and in Proposition [3? 
(hi) for d(x) we have 

1 _ 1 

—d(x) < r(x) < — d(x). 
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Let J be the set of coordinates where x„- € {0 fc , l k }. If we choose y such that yj agrees with (xi)j for 
j € J, then d(y, x{) < n — \ J\ = d(x). Thus r(x) < d(x). This concludes point (i). 
(ii) is obvious from previous formulas. 

Let us assume that r(x) = r(x) = r and let y be the center of a ball of radius r containing all Xj. 
Then we have the inequalities 

r = r(x) < — d(y, Xi) < maxdfy, xA = r 
k ' i 

i 

thus \ Yli d{y, Xi) = maxj d(y, Xi) which means that all d(y, x{) are equal to r. 

We build a counterexample with k = 3. If noi,nio,nn are even numbers, the points will be 
equidistant to some point y with y w = n w /2. We assume moreover that noi < nio < n\\. From 
Proposition 13 .4[ r(x) = (nio + nn)/2 and from Proposition 13.71 r(x) = (noi + nio + nn)/3 so if 
2noi < nio + ran we are done. □ 

4. Code invariants and their significance 

4.1. Code invariants. 

Definition 4.1. For any C C H n , and for f = d,r, r, we define 

fk-i(C) =min/(xi,...,x fc ) 
where the minimum is taken over all k-tuples ofpairwise distinct elements ofC. Moreover we define 

4 ff i(C) = mind^i, . .. ,x k ) 

where the minimum is taken over all k-tuples of ajfinely independent elements of C. Following stan- 
dard notation in coding theory, we let A^_i(n, /, m) be the maximal number of elements that a code 
C C H n can have under the condition fk-i(C) > m. 

Proposition 4.2. The following hold: 

(i) d\{C) = df^(C) is the Hamming distance of the code C. 

(ii) d 2 (C) = df(C). 

(iii) IfC is a linear code, and < k <2 t , 4_i(C) = df(C). 

(iv) IfC is a linear code, (i5^. 1 (C) coincides with the minimum (k — l)-th generalized weight as 
defined in ('19'], namely: 

df^iC) = mm{w{D) : D C C, D linear, dim(D) = k - 1}. 

where w(D) is the set of coordinates i at which at least one element of D is non zero. 

Proof. Obvious. □ 

Remark. The quantity dk(C) is more natural and easier to deal with than the more intricate d^(C). 
Unfortunately, d^{C) only coincides with the minimum fc-fh generalized weight of a linear code for 
k = 1, 2, hence the definition of d a ^{C), originally stated in Q. In yet another generalisation of 
the minimum fc-th generalized weight to non-linear codes is introduced that does not consider affinely 
independent sets of vectors. We will not dwell on the differences here and our study will mostly focus 
on the quantity dk (C) itself, of interest in its own right since it has a natural interpretation in terms of 
list decoding "radius" for lists of size k when decoding from erasures (see section I4T21 below). 

Proposition 4.3. For f = d, d aS , r, r and for any code C, 

A-i(c) < h{C) 



10 



CHRISTINE BACHOC, GILLES ZEMOR 



Proof. It follows from Propositions |3.2| (viii), |3.4| (iv) and |3.7| (iv) that for pairwise distinct x 

f(xi, . . .,x k+l ) > f k -i(C) 
respectively for affinely independent x 

d(x 1) ...,x k+1 )>d£_ 1 (C). 
Hence / fc _i(C) < f k {C). □ 

4.2. List decoding. A list decoding procedure is a decoding procedure that outputs a list of code- 
words. The length L of the list is determined in advance. This list is usually obtained by the enumer- 
ation of all codewords in a ball B(y, r). For a given code C, the associated value of r is known as the 
L-list decoding radius of C: 

Definition 4.4. The L-list decoding radius Rl(C) is the largest value ofr such that, for all y G H n , 

caxd(B(y,r) DC) < L. 

In the case L = 1, we recover the notion of the (unique) decoding radius of a code, R\(C) = 
l(d(C) — l)/2j. This number is also the largest value of r such that the balls of radius r centered at 
the codewords have the property that any L + 1 of them have an empty intersection. A set of balls with 
this property is called a L-multiple packing. Thus a classical packing of balls is a 1-multiple packing. 

Proposition 4.5. 

Rl(C) = r L (C) - 1. 

Proof. There exists (x\, . . . ,xl+i) G C L+1 and y € H n such that for all 1 < i < L + 1, X{ G 
B(y,r L (C)) and x { ^ xj thus caid{B (y,r L (C)) n C) = L + 1 and R L (C) < r L (C). Moreover, 
if r < ri{C), L + 1 codewords cannot be elements of the same ball of radius r thus Rl{C) = 
r L {C) - 1. □ 

The notion of list decoding can also be investigated in the framework of erasure decoding, see JTTJ. 

Definition 4.6. The L-list decoding radius for erasures R e £(C) is the largest value ofr such that, for 
all E C {1, . . . , n}, card(£) < r, and for any y = (y l ) i ^ E G {0, l} n ~ c ^ d ( E ) 

card({x G C : {xi)i^ E = v}) < L. 

The following proposition, which is a straightforward consequence of the definition of c?l, makes 
generalized distances relevant to erasure decoding ifTTl l20l l2TTl . 

Proposition 4.7. 

Rl(C) = d L {C) - 1. 



5. Upper bounds for d k , r k , r k . 

In this section we gather the analogs of the Singleton, Hamming, Plotkin and Elias bounds for 
f = d,r, r. The methods are well-known and some of the bounds may be found explicitely in the 
litterature, but not always in form precise enough for numerical computation (in particular only as- 
ymptotic versions of the Elias bounds can be found) which we need to compare them to the new SDP 
bounds. 
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5.1. The Singleton bound. This bound for d is the most elementary and is a natural generalisation 
of the classical Singleton bound for the ordinary Hamming distance. 

Proposition 5.1. Let C C H n . Then, if 4-1 (C) > 4-1 

|C| < (k-l)2 n ~ dk - 1+1 . 

Proof. Consider the restriction of the codewords on a fixed set of (n — 4—1 + 1) indices. The number 
of possible images is of course 2 n ~ dk ~ 1+1 . If \C\ > (k—l)2 n ~ dk - 1+1 , there is a subset of k codewords 
having the same image. Thus they have a generalized Hamming distance at most equal to 4-1 + 1 
and we have a contradiction. □ 

It is worth noticing that the Singleton bound for k = 3 is tight for d = 3 and for d = n. 

5.2. Hamming type bound. This volume type bound is established in [7] [Prop II.I] for 4-1 and 
for linear codes and generalized to the non-linear case in |2). We take the following notations: the 
number of elements of a ball of radius r in H n is denoted b" or b r if n is clear from the context. We 
recall the formula 

r / \ 

n 



k=0 

Proposition 5.2. Let C C H n . Then 

(i) Ifr k -i(C) > r fc _i or r fc _i(C) > r fc _i then 

- 1)2 



\C\ < 



un 



(ii) 7/d fc _i(C) > 4-i then 



(iii) Ifdf_ x (C) > 4_i then 



\c\< (k ~ 1)2n 



h n 



r>n+A;— 2 

\c\ < 



Proof, (i) If rfc_i(C) > r^-i or Ffe_i(C) > r^^i, from Proposition I3.8l (i) and Proposition 14.51 we 
have, for all y G H n , card(i?(y, r^-i — 1) D C) < k — 1. In order to establish the announced 
inequality, we count in two ways the elements of 

E := {(c,y),c eC,y e H n : d(c,y) < r k _ x - 1}. 

We have 

card(E') = cardjy G F n : d(y,c) < r fe _i - 1} 

cgC 

= 

= cardjc € C : d{y,c) < r k -\ — 1} 
yeH n 

< card(H n )(k - 1) = (jfc - 1)2™. 
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(ii) If dfe-i(C) > 4-i» from Proposition [X8](i) we have r k _i{C) > 1"%^] thus we can apply the 
previous result. 

(hi) Let (xi, . . . , x k ) S C k be affinely independent and let y £ H n . We have 



4-1 < d( Xl ,...,x k ) < 7— j- ^ d(xi,Xj) 

l<i<j<k 



l<i<j<k 

< ^2 d (xi,y). 



l<i<k 

Thus for some i, 1 < i < k, d(xi,y) > {^E^]. Since any subset of H n with at least 2 k ~ 2 + 1 
elements contains k affinely independent ones, we have for all y £ H n , 

caT d(B(y, r^i]-l)nC7)<2 fe - 2 . 

and we follow the same line as in (i). □ 

5.3. Plotkin type bound. This type of bound is usually derived from the estimate of the average 
value of / among C k . This average value can be estimated when / can be calculated from its value at 
each coordinate, which is the case for f = d,r. 

We take the following notations: let C be a binary code with M elements; let Wj be the number of 
ones in the j-th column of the M xn matrix whose rows are the M elements of C. Let J k {C), respec- 
tively J£ ff (C) be the set of A>tuples of pairwise distinct, respectively affinely independent codewords. 
We moreover define 



jk{x) :-- 

and 



if x < k - 1 



if (x) 



ifx<2 k ~ 2 
xUt=o( x ~ 2t ) Hx>2 k ~ 2 . 
We have obviously \J k (C)\ = jk(M) and | (C) | > jjf(M), this last inequality being an equality 
if C is linear. For x € R, we also denote as is usual (?) := j k (x)/k\. 

Proposition 5.3. With the above notations: 
(i) lfd k _i(C) > 4_i then 

4-i . n ( M / 2 ) 



< 1 -2- 



(?) 

(ii) C linear or ifk = 3, and if d^ 1 (C) > 4-i> we /iave 

. 4-1 , 1 v M 



(iii) Ifr k -i(C) > r fc _i ?/?en 



n 
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Proof, (i) For the generalized Hamming distance, we have 

n 

x£j k (C) x£j k (C) j=l 

n 

= X)( Y d (( x i)i> ■ ■ ■ ' ( x k)j)) 

3=1 xd.J k {C) 
n k—1 



j=l i=l 
(M-x\ . ( x \ (M-x\ 



u>i\ I M — Wj 
k — i 



The function x —> [T) ( ) + {j^-i) ( % X ) * s concave an d invariant by x — > M — x thus it takes its 
maximum at x = M/2. We derive the inequalities: 

(ii) In the special case k = 3, we obtain from (i) the desired inequality. In the case C linear, 
we observe that Wj = 0, M, M/2 and that d{(x\)j, . . . , (xk)j) is non zero only if Wj = M/2 and 
x\, . . . , Xk do not all belong to {x G C : Xj = 0} or to {x € C : Xj = 1}, which have M/2 elements. 
Thus 

jf (M)4-i < n{jf{M) - 2jf (M/2)) 

hence the announced inequality. 

(iii) The result for r is derived similarly to the result (i) in the d case. □ 

Remark 5.4. The upper bounds established in Proposition \5.3\ can easily be turned into upper bounds 
for M = \C\. Indeed, if (j>k-i '■= fk-i/ n < A(M) / B(M) where A and B are polynomials of the 
same degree, with respective leading coefficients a and (3, with B(M) > 0, then, if <pk-i ^ a /P< M 
is upper bounded by the largest zero of the polynomial 4>k_iB(M) — A(M). The bound obtained this 
way holds for 5^ > 1 - l/2 fe - 1 and p k -i > 1/2 - ( L(fc *1) /2 j ) /2*- 

5.4. The Elias-Bassalygo technique and constant weight codes. We recall that Ak-i(n, f, m) de- 
notes the maximal number of elements that a code C C H n can have under the condition fk-i(C) > 
m; analogously let Ak-i(n, w, f, m) be the maximum among the codes with constant weight w. With 
a standard argument, the following inequality holds: 

^ A k _i(n,f,m) < A k _i(n,w,f,m) 



card(ff n ) ~ card(J^) 

where J™ is the set of the (™) binary words of length n and weight w. This so-called Elias Bassalygo 
technique is expected to improve the bounds on H n , if similar bounds are established on the Johnson 
spaces J™. Note that the value of w on the right hand side can be chosen freely. This line was 
followed in [7 ] for the generalized Hamming distance, and required moreover to extend the methods 
to non linear codes. In view of Q, we work out Plotkin type bounds for constant weight codes: 

Proposition 5.5. Let C C J™ have M elements and let ui := w / n. 

(i) Ifd k - X (C) > 4_i then 

d k~l — — ^ 1 7m 



n 



( M k) 
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(ii) Vdf^C) > 4_i, we have 



n jf(M)\ (*) 
(iii) 7jfr fe _i(C) > r fc _i f/ien 



(l) 

Proof. For d and r, we follow the same line as for the proof of Proposition 15.31 There we applied 
the inequality Y^j=\ 9( w j) ^ ng(M/2) for relevant functions g, being concave and invariant by 
x — > M — x. Since C C J™, we have Y^j=i w j = Mw, so we can instead use the stronger inequality 

Y!j=i9{wj) < ng(Mw/n). □ 

6. The SDP bound for d 2 , r 2 , r 2 . 



The method developed in [15] can be used to derive upper bounds for the cardinality of a binary 
code C with given d 2 {C) (respectively r 2 (C), r 2 (C)). Recall that d 2 (C) > d if and only if, for 

all (x, y, z) G C 3 such that x ^ y, y ^ z, d(x, y) + d(y, z) + d(z, x) > 2d , z ^ x (respectively 
r 2 (C) > riff max( \^-], \^-], \^-]) > r and r 2 (C) >rffid(x,y)+d(y,z)+d(z,x) > 6r 
for all (x, y, z) G C 3 ). 

The SDP constraints at work in [15 ] are exactly SDP constraints on triples of points. In order to 
describe these constraints we adopt the group theoretic point of view of (H, iTTTIl . lfl8l . 

Let X := H n and, for all k := . . . n, the so-called Johnson spaces := {x, x £ X : wt(x) = 
k}. We consider the action of the symmetric group S n on H n . The Johnson spaces Xf. are exactly 
the orbits of this action. Now we consider the decomposition of the functional space L 2 (X) = M. x of 
real valued functions on X under the action of S n . The space R x is endowed with the ^-invariant 
scalar product 



(/,<?) = E 



\x\ 

1 1 xex 



We have the obvious decomposition into pairwise orthogonal 5 n -invariant subspaces: 

R x = R x i 1 R x i _L • • • _L R Xn . 
The decomposition of R Xk into S n -irreducible subspaces is described in [10]. We have 

R Xfc = H ^ -L U\.k -L • • • -L -ff m in(fc,ri-fc),fc 

where the are pairwise isomorphic for fixed i and pairwise non isomorphic for fixed k. The 
picture looks like: 

R x = M Xl _L R Xl J_ ... ± M X Lf J ± ... _L M^"- 1 _L R x " 

Ho,o -L Ho,i -L ••• -L -ffo, |_f J ••• -L -Ho,n-i -L -^o,n 
■ffl.i -L • • • -L Hi n -i 



LfJ-LfJ 
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where the columns represent the decomposition of M. Xk and the rows the isotypic components of R x , 
with multiplicity n — 2k + 1, i.e. we have for < k < [f J, 

TT I TT I I TT ^ Tjn — 2k+l 

"■k,k -L tlk,k+l -L ■ ■ ■ -L tlk,n-k — t± kk 

To each of these isotypic components, indexed by k, for < k < [§J , we associate a matrix E k 
of size n — 2k + 1 as explained in J3), indexed with s, t subject to k < s, t < n — k, in the following 
way: Let (e fcjM , e kA2 , • • • , ek,k,h k ) be an orthonormal basis of H^ k and let e kySj = ip kyS (e kAj ). Tne 
application ip kyS is defined by: 

tp kjS : R Xk R Xk 

f !-> -0fe,s(/) : ^k,s(f)(y) = T,wt(x)=kf(x) 

xCy 

and has the property to send and orthonormal basis of H kk to an orthogonal basis of H k s , the 
elements of this basis having constant square norm equal to ("it)- The (s,t) coefficient of E k is 
defined by: 

-. h k 

Ek, s ,t(x,y) = -r-y^e kiS: j(x)e k ,t,j{y)- 

3=1 

From O, E kySyt (x, y) = E k>Sjt (gx, gy) for all g G S n . Thus for A; < s < t < n — k, we can define 
Pk,s,t by E kySyt (x,y) = P kySy t(s — \x fl y|). It turns out that these P k)S ,t express in terms of Hahn 
polynomials. 

The Hahn polynomials associated to the parameters n, s, t with < s < t < n are the polynomials 
Q k (n, s, t; x) with < k < min(s, n — t) uniquely determined by the properties: 

(i) Q k has degree k in the variable x 

(ii) They are orthogonal polynomials for the weights 

< i < s w(n,s,t:i) = ( | ( | 
" " v ' ' ' ' \ij\t-s + i) 

(hi) Q fc (0) = 1 

The combinatorial meaning of the above weights is the following: 

Lemma 6.1. Given x € X k , the number of elements y G Xt such that \x n y\ = s — i is equal to 

w(n, s, t; i). 

Finaly we have: 

Proposition 6.2. Ifk < s < t < n — k, wt(x) = s, wt(y) = t, 

^t-fc^ ^n-2k^ 

Ek, s ,t{x,y) = \X\ S ~/ w wm — Q k {n,s,t;s - \x(ly\) 
it) \s) 

Ifwt(x) ^ sor wt(y) ^ t, E k , s ,t{x, y) = 0. 

By the construction, the matrices E k satisfy the semidefinite positivity properties: 
Theorem 6.3. For all k, < k < \_\\,forallC C H n , 

Ek(c,c)hO. 

(c,c')ec 2 
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These constraints are not interesting for pairs because they are not stronger than the positivity 
properties from Delsarte method. They are only interesting if triples of points are involved: namely 
we associate to (x,y, z) G iT 3 the matrices 

F k (x,y,z) := E k (x - z,y - z). 
We have for all C C H n , and for all z G H n , 

F k (c,c',z)yO 
which leads to the two positive semidefinite conditions: 

,~ J E( c , cV " )e c3^(c,c',c")>=0 

From Proposition 16.21 E k (x — z,y — z) only depends on the values of wt(x — z), wt(y — z), 
wt(x — y); so with a := d(y, z), b := d(x, z), c := d(x, y), we have for some matrices T k (a, b, c), 

F k (x,y,z) = T k (a,b,c). 
We introduce the unknowns x a ,b,c of the SDP. Let, for 

a + b + c = mod 2 
a + b + c < 2n 

(a,b,c) G SI := {(a,b, c) G [0 . . . n] 3 : c<a + b > 

b < a + c 
a <b + c 

x a ,b,c ■= j^7| card{(x, y, z) G C 3 : d(y, z) = a, d(x, z) = b, d(x, y) = c}. 

Note that 

1 „ 
Xo,c,c = j^j card{(x, y) G C : d(x, y) = c}. 

With the definition 

t(a, b, c) := card{z G H n : d(x, z) = b and d(y, z) = a} for d(x, y) = c 
= (-)(n)wherea-6 + c = 2i 

the following inequalities hold for x a i c : 

(i) ^0,0,0 = 1 

(ii) x aAc > 

(iii) x aAc = cc r ( ) jr ( 6 ) !T ( c ) for every permutation r of {a, b, c} 

(iv) x a ,b,c < t(a, b, c)x , CjC 

(v) x a>b)C < t(b, c, 

(vi) x aj 6 )C < t(c, a, 6)x ,b,6 

(vii) Ea b c T k(a, b, c)x aAc h for all < k < L§J 

(viii) Ea,b,c T k(a, b, c)(t(a, b, c)x ^ c - x aAc ) h for all < k < L§J 

where conditions (vii) and (viii) are equivalent to (f3]). To the above semidefinite constraints we add the 
extra condition (ix) that translates the assumption that ^(C) > d for some given value d (respectively 

(C) > r, ¥2(0) > r), namely 
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(ix) d x aAc = if abc ^ and a + b + c < 2(d - 1) 
respectively 

(ix) r x at b,c = if abc / and a + b + c < 6r 

or 

(ix) r s a , b , c = if a&c ^ and max( [§] , [§] , [§] ) < r - 1. 
It remains to notice that 

(X) |C| = Ec^O^c- 

Thus an upper bound on \C\ is obtained with the optimal value of the program that maximizes 
S c x o,c,c under the constraints (i) to (ix). 

It is worth noticing that the conditions (ix) can be replaced by any other conditions of the type 

(ix*) x a ,b,c = if (a, b,c) El 
where / is a set of forbidden values in C related to some other situation. In the classical case treated 
in Gil, dt(C) >S,I = {(a,b,c) : a or 6 or c € [1 . . . (5 - l)] 3 }. 

7. Numerical results 

In this section we compare the SDP bounds obtained for A 2 (n, d, m) and for A 2 (n, r, m) with the 
previously known bounds, stated in Section [5] We recall the obvious values A2(n,d, 3) = 2 n , 
A 2 (n,d,n) = 4, A 2 (n,r,l) =2 n ,A 2 (n,r, [n/2\) =4. 

Table Q] gives two upper bounds for A 2 (n, d, m): one is the tightest of the combinatorial bounds 
of section [51 with a superscript 1, 2, 3, 4 denoting which of the four methods, Singleton, Hamming, 
Plotkin, Elias (respectively) achieves this best, and the other is the bound obtained by the SDP method 
of Section [6] As we can see, in the non-trivial cases the SDP bound gives a substantial improvement 
almost all the time. 

For the radius r, we can restrict ourselves to codes in which the pairwise distances are even. Let 
us denote A^(n, r, m) the maximal number of elements of such a code with minimal radius at least 
equal to m; then one easily sees that A 2 (n,r, m) = A^n + l,r,m), with the standard extension 
of an optimal code to an even code with an extra coordinate. Table [2] compares the best bound for 
j4^~(n, r, m) (in italics) given by the combinatorial methods of Section|5jto the SDP bound. Again we 
have improvements in almost every instance. 
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